designing a procurement mechanism based on q-learning with an action-selection policy based on pso algorithm

نویسندگان

زهره کاهه

رضا برادران کاظم زاده

چکیده

in this paper, tender problems in an automobile company for procuring needed items from potential suppliers have been resolved by the learning algorithm q. in this case the purchaser with respect to proposals received from potential providers, including price and delivery time is proposed; order the needed parts to suppliers assigns. the buyer’s objective is minimizing the procurement costs through learning from previous tenders. we consider this problem as a markov decision problem in which each action is depend on the last state and last action. to resolve this problem, a type of reinforcement learning algorithms (q-learning) is developed; in which the particle swarm optimization algorithm is applied to select the optimal action as an optimal action-selection policy in q-learning algorithm. in comparison to this algorithm in which the action-selection policy is greedy, this proposed algorithm is more effective and efficient.

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

designing unmanned aerial vehicle based on neuro-fuzzy systems

در این پایان نامه، کنترل نرو-فازی در پرنده هدایت پذیر از دور (پهپاد) استفاده شده است ابتدا در روش پیشنهادی اول، کنترل کننده نرو-فازی توسط مجموعه اطلاعات یک کنترل کننده pid به صورت off-line آموزش دیده است و در روش دوم یک کنترل کننده نرو-فازی on-line مبتنی بر شناسایی سیستم توسط شبکه عصبی rbf پیشنهاد شده است. سپس کاربرد این کنترل کننده در پهپاد بررسی شده است و مقایسه ای ما بین کنترل کننده های معمو...

Providing a Bird Swarm Algorithm based on Classical Conditioning Learning Behavior and Comparing this Algorithm with sinDE, JOA, NPSO and D-PSO-C Based on Using in Nanoscience

There can be no doubt that nanotechnology will play a major role in our futuretechnology. Computer science offers more opportunities for quantum andnanotechnology systems. Soft Computing techniques such as swarm intelligence, canenable systems with desirable emergent properties. Optimization is an important anddecisive activity in structural designing. The inexpensive re...

متن کامل

mortality forecasting based on lee-carter model

over the past decades a number of approaches have been applied for forecasting mortality. in 1992, a new method for long-run forecast of the level and age pattern of mortality was published by lee and carter. this method was welcomed by many authors so it was extended through a wider class of generalized, parametric and nonlinear model. this model represents one of the most influential recent d...

15 صفحه اول

Diagnosis of hyperlipidemia in patients based on an artificial neural network with pso algorithm

One of the most common and most dangerous diseases of blood fats are such as heart disease, diabetes and stroke, heart and brain. It can control the timely diagnosis, treatment and then prevention of complications is become very effective even without using medicine. Heart disease and diabetes file if patients has useful information that can be used to estimate blood fat timely diagnosis. In th...

متن کامل

the effect of lexically based language teaching (lblt) on vocabulary learning among iranian pre-university students

هدف پژوهش حاضر بررسی تاثیر روش تدریس واژگانی (واژه-محور) بر یادگیری لغات در بین دانش آموزان دوره پیش دانشگاهی است. بدین منظور دو گروه از دانش آموزان دوره پیش دانشگاهی (شصت نفر) که در سال تحصیلی 1389 در شهرستان نور آباد استان لرستان مشغول به تحصیل بودند انتخاب شده و به صورت قراردادی گروه آزمایش و گواه در نظر گرفته شدند. در ابتدا به منظور اطمینان یافتن از میزان همگن بودن دو گروه از دانش واژگان، آ...

15 صفحه اول

Policy Learning with Hypothesis based Local Action Selection

For robots to be effective in human environments, they should be capable of successful task execution in unstructured environments. Of these, many task oriented manipulation behaviors executed by robots rely on model based grasping strategies and model based strategies require accurate object detection and pose estimation. Both these tasks are hard in human environment, since human environments...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

عنوان ژورنال:

مدیریت زنجیره تأمین

جلد ۱۸، شماره ۵۱، صفحات ۴۰-۰

کلمات کلیدی

میزبانی شده توسط پلتفرم ابری doprax.com